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Claims 

What is claimed is: 

1 . \ An information retrieval system, comprising: 

\a hierarchal analysis component that receives a query and processes 
probabilities associated with N categories, each category having one or more topics, N 
being an integer; and 

an interactive component that provides feedback derived from the query and 
the probabilities associated with the N categories and the one or more topics, the 
feedback being utilized to determine at least one category of the N categories to 
facilitate retrieval of at least one of the one or more topics. 

2. The systeniof claim 1 , further comprising an automatic classifier construction 
component that buiPds a top-level classifier for the N categories and a sublevel 
classifier for each category of the one or more topics associated with the N categories. 

3. The system of craim 2, wherein the classifiers are provided by at least one of a 
Support Vector Machine\Naive Bayes, Bayes Net, decision tree, similarity-based, 
vector-based and a Bayesian-based classification model. 

4. The system of claim 3, wherein the automatic classifier construction 
component employs a learningmodel to build the classifiers. 

5. The system of claim 4, wherein the learning model is associated with a 
Support Vector Machine and employs Sequential Minimal Optimization (SMO) to 
train the classifiers. \ 
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6. ' \ The system of claim 4, further comprising a data structure that includes a 
mapping of I possible queries and one or more associated topics, I being an integer, to 
enableylearning for the classifiers. 

7. The system of claim 6, wherein the data structure is updated via at least one of 
implicit and explicit user actions associated with a query to facilitate improved 
learning models. 

8. The system of claim 6, wherein the data structure is centrally located to enable 
monitoring of implicit and explicit user actions associated with queries from a 
plurality of usersVo facilitate improved learning models. 

9. The system &f claim 2, wherein the first classifier is employed to drive the 
sublevel classifiers aftrun time to form a hierarchical classification structure. 

1 0. The system of claim 9, wherein the query and the first classifier are employed 
to determine the most lively of the N categories. 

1 1 . The system of claim 1 0, further comprising a context disambiguation 
component that utilizes the query and the first classifier to determine the feedback. 

12. The system of claim 1 1, wherein the context disambiguation component 
utilizes the query and the feedback to drive the sublevel classifiers in order to 
determine a desired topic. 

13. The system of claim 11, wherein the context disambiguation component 
further comprises a presentation component for interfacing to a user and an analytical 
component to facilitate feedback ana decision-making related to the feedback. 
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14\ The system of claim 13, wherein the analytical component includes a cost- 
benefit analysis considering the cost of the dialog with the information value of the 
dialog. v 

1 5. The system of claim 1 3, wherein the analytical component includes a 
decision ^analysis for determining the nature and quantity of a clarification dialog. 

16. The system of claim 1 3, wherein the analytical component includes a 
computation of the value of information associated with feedback gained during a 
clarification dialog to guide the nature and quantity of the clarification dialog. 

17. The system of claim 13, wherein the analytical component employs at least 
one of a rule-based policy and an expected utility policy that controls if and how 
dialog is invokecftbased on the distribution of probabilities assigned to topics at one or 
more layers of a classification scheme. 

1 8. The system ©f claim 1 7, wherein the analytical component analyzes 
probabilistic weights\associated with each category and related subtopic for 
determining feedback and presentation to the user. 

19. The system of claim 17, wherein the analytical component analyzes 
probabilistic weights as aVspread across each category and related subtopic for 
determining feedback and presentation to the user. 

20. The system of claim 13, wherein the presentation component includes a 
ranked display of most likely N categories. 

2 1 . The system of claim 20, wherein at least one of the most likely N categories is 
selected to provide a ranked disproof one or more topics. 
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22. \ The system of claim 1 , wherein information is retrieved as part of a help 
system. 

23. The system of claim 1 , wherein information is retrieved from a network-based 
system. \ 

24. The system of claim 1, wherein the probabilities are determined via a hand- 
crafted analysis. 

25. The system of claim 1 , further comprising L levels of N categories, each 
category having one or more topics, wherein L and N are integers. 

readable medium storing the computer-executable components of 

27. A method providing information retrieval, comprising: 

determining probabilities associated with one or more categories associated 
with one or more associated topics; 

providing feedback that is derived from a query and the probabilities 
associated with the one\or more categories and the one or more associated topics; and 

resolving at least one category of the one or more categories based upon the 
feedback to facilitate retrieval of at least one of the one or more associated topics. 

28. The method of claim 27, further comprising, 

building a top leveAclassifier for the N categories and a sublevel classifier for 
each category of the one or more topics associated with the N categories. 

29. The method of claim 28, wherein the classifiers are at least one of a vector- 
based and a Bayesian-based model. 



26. A compute 
claim 1. 
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30\ The method of claim 29, further comprising, 

\ mapping I possible queries and associated topics within a data structure, I 
beingWi integer, to enable learning of the classifiers. 

3 1 . The method of claim 30, further comprising, 

monitoring implicit and explicit user actions associated with a query to 
facilitate improved learning models. 

32. The method of claim 30, further comprising, 

monitoring a central data location for implicit and explicit user actions 
associated with queries from a plurality of users to facilitate improved learning 
models. \ 

33. The method of claim 28, wherein the top-level classifier is employed to drive 
the sublevel classifiers at run time to form a hierarchical classification structure. 

34. The method of cJaim 33, wherein the query and the top-level classifier are 
employed to determine trie most likely of the N categories. 

34, further comprising, 

id the top-level classifier to determine the feedback. 

36. The method of claim 35, further comprising, 

utilizing the query andVhe feedback to drive the sublevel classifiers in order 
to determine a desired topic. \ 



35. The method of cl ai\ 
utilizing the query ; 



25 



MS 150905.1 



37. \The method of claim 27, further comprising, 
lilizing at least one of a cost benefit analysis and a decision analysis for 

determining the feedback. 

38. The method of claim 35, further comprising, 
utilizing rule-based policies and expected-utility policies for establishing 

probabilistic thresholds associated with the feedback. 

39. A system providing information retrieval, comprising: 
means for Determining probabilities associated with N categories, each 

category having one or more topics, N being an integer; 

I means for providing feedback that is derived from a query and the 

probabilities associated with the N categories and the one or more topics; and 

\ll means for determining at least one category of the N categories based upon the 

UI . 

gj feedback to facilitate retrieval of at least one of the one or more topics. 

ru 

M - 

a 40. A computer-readable medium having stored thereon a data structure, 

O 

gj* comprising: 

f W a plurality of query\fields representing data associated with previous user 

p interactions with a data basfe; 

c- L \ 

a plurality of topic fipldsjrepresenting possible topics associated with the 
query fields; and 

at least one categoiV^eltTSerived from the plurality of query and topic fields 
to enable information retrievalurom the topics. 



26 



MS 150905.1 



41 . A signal adapted to be transmitted between at least two processes, comprising: 

an analysis component that receives a query via the signal and processes 
probabilities associated with N categories, each category having one or more topics, N 
being an integer; and 

an interactive component that provides feedback via the signal derived from 
the query and the probabilities associated with the N categories and the one or more 
topics, the feedback being utilized to determine at least one category of the N 
categories to facilitate retrieval of at least one of the one or more topics. 
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